Classifying Microarray Data Using Pairwise Similarity between Gene Profiles
نویسندگان
چکیده
Figure 1: Dendrogram with a threshold applied (---) to create clusters. The first cluster has genes A and B while the second one has genes C, D, and E. genes to be measured simultaneously. However, interpretation of the large data sets is complex. Commonly used methods include hierarchical clustering and k-nearest-neighbor [3]. After applying these methods, clusters of co-expressed genes are produced, which can then be used as a fingerprint to identify a biological response. The basis of any gene clustering algorithm is the similarity (or dissimilarity) measure between gene expression profiles. After every possible gene-pair has been considered, hierarchical clustering recursively pairs the genes in order of increasing dissimilarity. A new score is assigned to a gene group which is obtained by aggregating the scores of the genes within the group [2]. Thus, this process resembles bottom-up creation of a binary tree, resulting in a dendrogram as depicted in Figure 1. By applying a threshold, the tree is cut horizontally to form subtrees, each representing a cluster. Hierarchical clustering has the advantage of allowing a user to visualize how genes merge to form clusters. However, individual relationships between genes within a cluster is lost once the cluster is formed. For example, gene E in the example may be more similar to gene C, but not gene D. In this poster, we focus on the pairwise nature of the hierarchical clustering process and propose a method of microarray classification using gene pairs instead of single genes or gene clusters.
منابع مشابه
Multiple gene expression profile alignment for microarray time-series data clustering
MOTIVATION Clustering gene expression data given in terms of time-series is a challenging problem that imposes its own particular constraints. Traditional clustering methods based on conventional similarity measures are not always suitable for clustering time-series data. A few methods have been proposed recently for clustering microarray time-series, which take the temporal dimension of the da...
متن کاملGene Microarray Analysis Using Angular Distribution Decomposition
Clustering techniques have been widely used in the analysis of microarray data to group genes with similar expression profiles. The similarity of expression profiles and hence the results of clustering greatly depend on how the data has been transformed. We present a method that uses the relative expression changes between pairs of conditions and an angular transformation to define the similari...
متن کاملExploiting sample variability to enhance multivariate analysis of microarray data
MOTIVATION Biological and technical variability is intrinsic in any microarray experiment. While most approaches aim to account for this variability, they do not actively exploit it. Here, we consider a novel approach that uses the variability between arrays to provide an extra source of information that can enhance gene expression analyses. RESULTS We develop a method that uses sample simila...
متن کاملA Self-supervised Learning Framework for Classifying Microarray Gene Expression Data
It is important to develop computational methods that can effectively resolve two intrinsic problems in microarray data: high dimensionality and small sample size. In this paper, we propose a self-supervised learning framework for classifying microarray gene expression data using Kernel Discriminant-EM (KDEM) algorithm. This framework applies self-supervised learning techniques in an optimal no...
متن کاملIntegration and Reduction of Microarray Gene Expressions Using an Information Theory Approach
The DNA microarray is an important technique that allows researchers to analyze many gene expression data in parallel. Although the data can be more significant if they come out of separate experiments, one of the most challenging phases in the microarray context is the integration of separate expression level datasets that have gathered through different techniques. In this paper, we prese...
متن کامل